Bayesian ranking and selection methods using hierarchical mixture models in microarray studies.
نویسندگان
چکیده
The main purpose of microarray studies is screening to identify differentially expressed genes as candidates for further investigation. Because of limited resources in this stage, prioritizing or ranking genes is a relevant statistical task in microarray studies. In this article, we develop 3 empirical Bayes methods for gene ranking on the basis of differential expression, using hierarchical mixture models. These methods are based on (i) minimizing mean squared errors of estimation for parameters, (ii) minimizing mean squared errors of estimation for ranks of parameters, and (iii) maximizing sensitivity in selecting prespecified numbers of differential genes, with the largest effect. Our methods incorporate the mixture structures of differential and nondifferential components in empirical Bayes models to allow information borrowing across differential genes, with separation from nuisance, nondifferential genes. The accuracy of our ranking methods is compared with that of conventional methods through simulation studies. An application to a clinical study for breast cancer is provided.
منابع مشابه
A marginal mixture model for selecting differentially expressed genes across two types of tissue samples.
Bayesian hierarchical models that characterize the distributions of (transformed) gene profiles have been proven very useful and flexible in selecting differentially expressed genes across different types of tissue samples (e.g. Lo and Gottardo, 2007). However, the marginal mean and variance of these models are assumed to be the same for different gene clusters and for different tissue types. M...
متن کاملDiagnosis of Breast Cancer Subtypes using the Selection of Effective Genes from Microarray Data
Introduction: Early diagnosis of breast cancer and the identification of effective genes are important issues in the treatment and survival of the patients. Gene expression data obtained using DNA microarray in combination with machine learning algorithms can provide new and intelligent methods for diagnosis of breast cancer. Methods: Data on the expression of 9216 genes from 84 patients across...
متن کاملThe Family of Scale-Mixture of Skew-Normal Distributions and Its Application in Bayesian Nonlinear Regression Models
In previous studies on fitting non-linear regression models with the symmetric structure the normality is usually assumed in the analysis of data. This choice may be inappropriate when the distribution of residual terms is asymmetric. Recently, the family of scale-mixture of skew-normal distributions is the main concern of many researchers. This family includes several skewed and heavy-tailed d...
متن کاملBAYESIAN MODELS FOR DNA MICROARRAY DATA ANALYSIS A Dissertation by KYEONG
Bayesian Models for DNA Microarray Data Analysis. (May 2004) Kyeong Eun Lee, B.A., Kyungpook National University, Korea; M.A., Seoul National University, Korea Co–Chairs of Advisory Committee: Dr. Bani K. Mallick Dr. James A. Calvin Selection of significant genes via expression patterns is important in a microarray problem. Owing to small sample size and large number of variables (genes), the s...
متن کاملBias-corrected Hierarchical Bayesian Classification with a Selected Subset of High-dimensional Features
Class prediction based on high-dimensional features has received a great deal of attention in many areas. For example, biologists are interested in using microarray gene expression profiles for diagnosis or prognosis of a certain disease (eg, cancer). For computational and other reasons, it is necessary to select a subset of features before fitting a statistical model, by looking at how strongl...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Biostatistics
دوره 11 2 شماره
صفحات -
تاریخ انتشار 2010